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ABSTRACT 



A signal processing system includes means for perform- 
ing a logic function on a multi-dimensional array of 
information stored in a memory. Typically, the memory 
stores two-dimensional video information (pixels), and 
the logic function is a discrete cosine transform (DCT), 
or other linear operation. The logic function is per- 
formed on both rows and columns of the information. 
In the prior art, this has required two memory spaces, so 
that information could be written into one memory 
while being read out of another memory. In the present 
invention, a single memory space is used to transpose 
the information between row and column format, by 
performing a read-modify-write operation on each 
memory location in a specified sequence. 
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cations. Still other signal processing techniques require 
SIGNAL PROCESSING SYSTEM HAVING that data be structured in two-dimensional data arrays, 

REDUCED MEMORY SPACE and that the data be operated on by rows and by col- 

umns. For example, 2-D filtering and 2-D transform 
This application is a continuation of application Ser. 5 operations require that information be supplied in both 
No. 07/647,325, filed on Jan. 29, 1991, now abandoned. column and row format. Applications include medical 

tomography, seismology, and radar analysis, among 

BACKGROUND OF THE INVENTION others. 



SUMMARY OF THE INVENTION 



1. Field of the Invention 

The present invention relates to a system for perform- 10 
ing operations on signals. I have invented a technique for signal processing that 

2. Description of the Prior Art transposes row and column information in a single 
The processing of two-dimensional (2-D) signals, as memory array. In this technique, the information that is 

for computer graphics or other video applications, often written into the memory array is supplied in a given 
requires storing a large quantity of information (data) in 15 format (e.g, column-wise). However, the memory array 
an array of memory cells. For example, most high defi- is read out in specified address sequences that provide 
nition television (HDTV) receiver designs require stor- the information in another format (e.g., row-wise), 
ing at least one frame of video information. In most After each memory location in the array is read out, the 
present-day HDTV designs, each frame comprises information for the next read operation is immediately 
about 1,000,000 picture elements (pixels). Considering 20 written into that location. The specified address sequen- 
that each pixel typically comprises three color elements ces may be determined using modulus arithmetic based 
that each require eight bits of brightness information, upon the dimensions of the array, 
the total number of binary bits that are required to de- DESCRIPTION OF THE DRAWINGS 

fine each frame is about 24 million. Other types of video 

displays, for example high definition graphics displays, 25 FIG. 1 shows an illustrative embodiment of a mem- 
also require a large number of bits to define each frame. ory array and address generator according to the pres- 
Therefore, the memory requirements are a significant ent invention. 

part of the cost of each such video receiver or computer FIG. 2 shows a logic diagram for determining the 
terminal. sequence of memory addresses. 

In many video designs, a linear transform operation is 30 FIG. 3 shows a typical circuit block diagram for 
performed on the information stored in the video mem- implementing the logic of FIG. 2. 
ory array. For example, to provide for a transmission FIG. 4 shows a typical prior art arrangement of two 
signal having as small a bandwidth as possible, at least memory arrays for providing information in both row 
one of the proposed HDTV techniques performs a dis- and column formats, 
crete cosine transform (DCT) on each stored frame. A 35 DETAILED DESCRIPTION 

typical system for performing the transform operation is 

shown in FIG. 4, wherein the stored information is The following detailed description relates to a signal 
divided into many blocks, each comprising H rows and processing system that includes means for transposing 
W columns (and referred to as an "HxW block")- signals between row and column format with a single 
Next, each block (40) is processed in column format 40 memory array. Referring to FIG. 1, a memory array 10 
(e.g,, col. 1, 2 . . . W) by a transform operation (41). In is illustrated wherein each of the data of a Hx W block 
order to transpose the block of information for process- occupies a unique address location inside the memory 
ing in row format, two memory arrays are typically array of the random access memory (RAM) 10. A read 
required. Information is stored in a first memory (43), or write operation is selected by line 15. Any one of the 
while simultaneously information is read out of a second 45 address locations may be accessed by supplying the 
memory (44) by rows (e,g., row 1, 2 ... H). When the address generated by the address generator (13) to the 
next block of video information that has been processed RAM (10), with a clock signal being supplied by line 14. 
by columns arrives, it will be written into the second The row and column decoding techniques, and con- 
memory (44). Simultaneously, the first memory (43) struction of the memory cells, may be accomplished 
will be read out by rows. In this manner, the informa- 50 according to principles that are well known in the art 
tion may be processed by the linear transform operation The memory cells may be of the static, or dynamic 
by both columns and rows, as required. This technique variety. There are typically multiple bits (Le., multiple 
is often called "ping-pong" in the art. memory cells) stored in each address location; how- 

In the special case wherein the memory array is ever, it is alternatively possible that only a single bit 
"square" (i.e., having an equal number of rows and 55 (single memory cell) is stored in each location. In the 
columns), it is known to transpose between column and present invention, the address generator is constructed 
. row formats by simply transposing the row and column (or controlled by software or firmware) so that the 
portions of the memory address bits. In that technique, sequence of the memory addresses provides the re- 
first the data is written into the memory array. Then, quired transposition between row and column formats, 
the address bits are transposed, and the array is read out, 60 as discussed below. 

which provides the data in the other format (e.g., row Referring to Table 1 below, an illustrative data array 
format). However, that technique does not work for (or a portion thereof, referred to as a "block" herein) 
"non-square" arrays (i.e., having an unequal number of includes six array elements. These array elements arc 
rows and columns). labelled a, b, ... f in lieu of the row and column format 

It can thus be seen that in a typical video graphics or 65 shown in FIG. 1. Note that the data array of Table 1 is 
HDTV application, large memory arrays are often re- displayed as it would relate to the spatial arrangement 
quired for transposition. It would be very advantageous of a 2-D data array (for example, pixels on a video 
to reduce the amount of memory required in such appli- screen), but that the physical locations of the memory 
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cells may be in any convenient pattern, according to the 
address decoding circuitry used. In fact, a given array 
of data may be physically stored in multiple integrated 
circuit chips, if desired. 

TABLE 1 5 

DATA ARRAY 
(COLUMN FORMAT) 



10 



25 



In the illustrative embodiment, the data arrives in 
column format. That is, each group of six incoming data 
{a, b, c, d, e, f} always arrive in a sequence that is com- 
posed of the first column (a and b), followed by the J5 
second column (c and d) and the third column (e and f), 
etc., as defined by the HDTV or other video system in 
use. Within each column (for example a and b), the data 
related to the first row arrives first (a), followed by the 
data of the second row (b), etc. After transposition, we 
desire to read the same data array in a sequence that is 
composed of the first row (a, c, e), followed by the 
second row (b, d, f), etc. This defines the row format 
read operation. In the present invention, a read-and- 
then-write (R&W) operation is performed on each 
memory address location in such a manner that the 
incoming data in one format (e.g., column format) are 
stored for readout in the desired format (e.g., row for- 
mat) of the stored data. Note that the read-and-then- 
write operation is performed on a given address loca- 
tion before proceeding to the next location. This type of 30 
operation is also referred to as "read-modify-write" in 
the memory art. 

For example, referring to Table 2 below, when initial- 
izing the memory, there is no information to be read, 
and the initial "Read & Write Sequence" may conve- 35 
niently be accomplished in the six memory locations 
labelled 0, 1, ... 5. Therefore, the first group of incom- 
ing data {Al , A2, Bl, B2, CI, C2} are stored in address 
locations 0, 1, 2, 3, 4, and 5, respectively. Table 2 shows 
contents of the memory at locations 0 through 5 after 40 
the above-noted six write operations. 

TABLE 2 



Read & Write Sequence: 



R&W Address: 0 
Write Data: Al 



1 

A2 



2 

Bl 



3 

B2 



4 5 
CI CI 



45 



In order to read out this block of stored information 
in row format, it is necessary to read the memory block 
so that the data sequence is Al, Bl, CI, A2, B2, C2, as 50 
can be seen from the general case of the data set {a, b, 
c, d, e, f} shown in Table 1. This can be accomplished 
by accessing the memory locations with the "R&W 
Address" sequence shown below in Table 3. Further- 
more, considering that the next six incoming data are 55 
labelled {Dl, D2, El, E2, Fl, F2}, (which again are 
arriving in column format), they will necessarily be 
stored (by virtue of the R&W Address sequence) as 
shown in the memory, array in Table 3: 



TABLE 3 



60 



Read & Write Sequence: 



R&W Address: 
Write Data: 



0 

Dl 



2 

D2 



4 

El 



1 

E2 



3 

Fl 



5 

F2 



65 



It can be seen that the resulting memory locations of 
these six data are not in the same order shown in Table 
2 any more. Therefore, in order to read out these six 



data in row format, Dl, El, Fl, D2, E2, and F2, they are 
read out by the read & write address sequence shown in 
Table 4 below. Furthermore, as these six data are being 
read out, the next six data are arriving (in column for- 
mat), and are labelled {Gl, G2, HI, H2, II, 12} respec- 
tively. Because they are written into the memory block 
in the same sequence as required by the read & write 
address sequence, they will be stored in the memory 
array as shown in Table 4: 

TABLE 4 

Read & Write Sequence: 



R&W Address: 
Write Data: 



0 

Gl 



4 

G2 



3 

HI 



2 

H2 



1 

II 



5 
12 



The next group of six incoming data arriving in col- 
umn format are labelled {Jl, J2, Kl, K2, LI, L2 }, 
respectively. They are treated as shown in Table 5, 
which results from reading out the previous block in 
row format, Gl, HI, II, G2, H2, 12: 



TABLE 5 



Read & Write Sequence: 



R&W Address: 0 
Write Data: Jl 



3 1 4 2 5 
J2 Kl K2 LI L2 



Finally, the next six group of six incoming data are 
labelled {Ml, M2, Nl, N2, Ol, 02 }, respectively. They 
are treated as shown in Table 6, which results from 
reading out the previous block in row format, Jl, Kl, 
LI, J2, K2, L2: 

TABLE 6 

Read & Write Sequence: 



R&W Address: 
Write Data: 



0 

Ml 



1 2 3 4 5 
M2 Nl N2 Ol .02 



It can be seen that these last six data are again ar- 
ranged in the same order as shown for the initial six data 
(Table 2). Thereafter, succeeding sequences are treated 
in the manner shown for Tables 2 through 6 above. 
Hence, information that arrives in column format is 
transposed into row format by the above procedure. 
Note that in the above procedure, the address generator 
(13 in FIG. 1) provides the proper sequences of ad- 
dresses to perform the column-to-row transformation 
shown in lie Tables. It can be seen from the above 
Tables that each sequence is an ordered set of all the 
address locations in a given memory array (or block 
portion thereof). It can also be seen that each sequence 
differs from all the preceding sequences, until the mem- 
ory array returns to its original configuration. The com- 
plete ordered set of differing sequences is referred to as 
a "cycle" herein, and the cycles repeat, as indicated. 

The above-noted exemplary procedure may be ac- 
complished on any rectangular memory array having 
dimensions of H and W, where H is the height of the 
array (i.e., number of rows), and W is the width of the 
array (Le., number of columns). The array may be 
square (H= W), or alternatively rectangular (H^W). A 
general logic sequence that provides the proper address 
sequence may be implemented using modulus arithme- 
tic, where the value of the modulus M is; 
M=HXW-1. 

This logic sequence is illustrated in FIG. 2, where: 

P is the "pointer" that defines the selected memory 
address for a given read & write operation. For exam- 
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pie, P assumes, in various sequences, the values of 0, 1 . 
. . 5 in the above illustrative embodiment. 

L is the "address incrementer", being the amount by 
which P is incremented from one read & write address 
to the next. For example, in Table 2 above, L=l, 5 
whereas in Table 3, L=2, and in Table 4, L=4, etc. 

It can be seen that the sequential value of P in the 
Tables 2 through 6 is calculated using modulus 5 arith- 
metic (M=3X2— 1). That is, when P is incremented by 
L to a value greater than 5, then 5 is subtracted from the 10 
result to detennine the new value of P. This is expressed 
arithmetically in FIG. 2 as: P=P+L mod (M). 

Note also from FIG. 2 that L itself may be calculated 
using modulus arithmetic. That is, after completing a 
given row-to-column transposition sequence (as shown 15 
in Table 2 for example), the pointer P returns to the first 
memory location (P=0), and a new value of L is calcu- 
lated as follows: L=LxH mod (M). 

For example, the initial value of L is 1 (Table 2); the 
next value of L is 1X2 (mod 5) =2 (Table 3), and the 20 
following value of L is 2X2 (mod 5)=4 (Table 4), etc. 
Note that as shown in FIG. 2, when the value of P 
returns to 0, a new value of L is calculated as described, 
and a new sequence begins. 

The foregoing logic sequence may be implemented in 25 
circuit form using a logic block diagram as shown in 
FIG. 3. As shown, the values of P, L, H and M are 
stored in registers 401, 402, 403 and 404, respectively. 
The multiplier 405 and modulus M calculator 406 are 
used to compute the next value of L. The zero detector 30 
407 signals when a given sequence (e.g., a given one of 
Tables 2 to 6 above) is complete, in which case register 
402 is updated according to the above equation. The 
adder 409 adds the values of P and L, and the modulus 
arithmetic is performed using subtracter 408 and multi- 35 
plexer 410. 

Other implementations of the logic diagram of FIG. 2 
are possible. For example, in off-line computations, 
high-level programming can be used to generate the 
addresses. AJteratively, a general purpose microproces- 40 
sor may be used to perform these calculations under the 
control of software or firmware. In the case wherein 
dedicated logic is used, as in FIG. 3, the memory array 
and address generator (including the logic of FIG. 3) 
are typically formed on the same integrated circuit chip. 45 
However, the present invention may be implemented 
using multiple integrated circuits, as when a micro- 
processor is used to generate the addresses for a multi- 
plicity of memory chips. Furthermore, while the ad- 
dress sequences may be computed using dedicated cir- 50 
cuitry that implement the modulus arithmetic, other 
techniques for generating the address sequences are 
passible. For example, a read only memory (ROM) may 
contain the necessary sequences. Since the sequences 
form a closed, repetitive cycle (i.e., the sequences repeat 55 
themselves after a finite number of 2-D data arrays have 
been transposed), they may be implemented with still 
other types of logic circuitry, all of which is included 
herein. 

While the above illustrative embodiment has shown a 60 
single transposition between row and column format, 
several such transpositions may be required in a given 
signal processing system. Hence, several memory ar- 
rays may be provided according to the inventive tech- 
nique. Furthermore, the signal information may be pro- 65 
vided in three-dimensional format, by adding a depth 
(D) dimension to the height (H) and width (W) dimen- 
sions shown. In that case, multiple transpositions may 



6 

be accomplished to construct a three-dimensional array 
out of multiple two-dimensional array. Note that the 
designation of "row" and "column" is arbitrary insofar 
as the present invention is concerned, and the transposi- 
tion may be accomplished in either direction. While the 
above illustrative embodiment has shown the general 
case of a non-square array, use of the present technique 
with a square array is also possible. Still other applica- 
tions of the present technique will be apparent to per- 
sons of skill in the art 
I claim: 

1. A signal processing system comprising: 

means for transposing a two-dimensional data array 
between row and column format, characterized in 
that said means for transposing comprises: 

an integrated circuit memory array having at least 
Hx W address locations, where H is the number of 
rows and W is the number of columns of said two- 
dimensional data array, and wherein said array is 
non-square (H^W); 

a P register for storing an address location (P); 

an L register for storing an incrementer (L); 

an address generator connected to said memory array 
for selecting each of said HxW address locations 
(P) in a repetitive cycle of sequences; 

means for performing modulus arithmetic on a given 
address location (P) in order to generate the next 
address location in the sequence according to for- 
mula: 

P={P+L} mod (M) 

wherein L is the incrementer by which P is incre- 
mented from one address location to the next, ac- 
cording to the formula: 

L-{LXH} mod (AT) 

and 

and wherein the modulus (M) is: 
M=Hx w-\- 

and further comprises means for performing a read 
operation followed by a write operation on each 
selected memory address location (P) before pro- 
ceeding to the next address location in a sequence. 

2. The signal processing system of claim 1 further 
comprising means for performing a linear transform on 
said data array by rows and by columns. 

3. The signal processing system of claim 2 wherein 
said linear transform is a discrete cosine transform. 

4. The signal processing system of claim 1 wherein 
said address generation means is located on the same 
integrated circuit as said memory array. 

5. The signal processing system of claim 1 further 
comprising video display means for displaying video 
information that is transposed by means of said memory 
array. 

6. The signal processing system of claim 5 wherein 
said video information is high definition television 
(HDTV) information. 

7. The signal processing system of claim 5 wherein 
said video information is computer graphics informa- 
tion. 

8. The signal processing system of claim 1 wherein 
said address generator is on the same integrated circuit 
as said memory array. 
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9. The signal processing system of claim 1 wherein M~Hx w-\ t 
said address generator is on a different integrated circuit 

than said memory array. and performing a read operation followed by a write 

10. A method of processing a signal, comprising the operation on each location (P) before proceeding 
step of transposing a two-dimensional data array be- 5 ^J** ne ^ ^cation m a sequence 

tween row and column format, . u ; ^ raeth °d of claim 10 wherein the step of select- 
characterized in that said transposing comprises the m * locatlon ? ^eludes performing modulus arithmetic 
steps of- for- on a gj ven location (P) in order to generate the next 

; r\. ' . A . . . . location in a sequence according to said formula, 

providing an integrated circuit memory array having n The mefl £ d Qf claim u * h£rem ^ Qf 

at least HxW memory locations where H is the , ecti locations mdudes rfonnin modulus £ thme . 

number of rows and W is the number of columns of tic Qn a ^ vcn valufi 0 f l m order to generate the next 

said two-dimensional data array, and wherein said value of L for the next seque nce in the cycle. 

array is non-square (H=^W) ; 13 ^ method of claim 10 further comprising the 

providing a P register for storing an address location ]3 step Q f performing a linear transform on said data array 

(?)>• by rows and by columns, 

providing an L register for storing an incrementer 14. The method of claim 13 whereuTsaid linear trans- 

(L); form is a discrete cosine transform, 

providing an address generator connected to said 15. The method of claim 10 wherein the steps of 

memory array for selecting each of said HxW 2 o selecting locations and providing a memory array are 

locations (P) in a repetitive cycle of sequences; accomplished on the same integrated circuit, 

providing means for performing modulus arithmetic 16. The method of claim 10 further comprising the 

on a given address location (P) in order to generate step of displaying video information processed by said 

the next address location in the sequence according transposing. 

to the formula: 25 17- The method of claim 16 wherein said video infor- 

mation is high definition television (HDTV) infonna- 
P={P+L)Omo<i{M) tion. 

18. The method of claim 16 wherein said video infor- 
wherein L is the incrementer by which P is incre- mation is computer graphics information, 
mented from one location to the next, according to 30 19. The method of claim 10 wherein said address 
the formula: generator is on the same integrated circuit as said mem- 

ory array. 

L={LxH} moditf) 20. The method of claim 10 wherein said address 

generator is on a different integrated circuit than said 
and wherein the modulus (M) is: 35 memory array. 
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